Journal of the College of Physicians and Surgeons Pakistan
ISSN: 1022-386X (PRINT)
ISSN: 1681-7168 (ONLINE)
Affiliations
doi: 10.29271/jcpsp.2025.05.574ABSTRACT
Objective: To outline immune-cell infiltration and identify diagnostic genes for atherosclerosis (AS) to better understand the potential molecular processes involved in AS development.
Study Design: Descriptive study.
Place and Duration of the Study: Department of Cardiology, The First Hospital of China Medical University, Shenyang, Liaoning, China, from 10th June to 8th October 2024.
Methodology: Relevant datasets were collected from the Gene Expression Omnibus database. Gene set enrichment analysis was conducted on differentially expressed genes (DEGs). Subsequently, three machine-learning algorithms were used to identify the core genes. Receiver operating characteristic (ROC) curves were used to analyse the clinical diagnostic value of the core genes.
Results: A Total of 3,307 DEGs, which were found primarily enriched in inflammation-related pathways. Further analysis of the core genes using three machine-learning algorithms revealed four intersecting genes, IBSP, PI16, MYOC, and IGLL5, which are all inflammation-related genes; they also showed good clinical diagnostic abilities, which were verified using ROC curves (area under the curve: 0.959, 0.946, 0.931, and 0.880, respectively).
Conclusion: IBSP, PI16, MYOC, and IGLL5 participate in AS pathogenesis by regulating inflammatory reactions. These are novel diagnostic markers and are expected to become potential targets for AS-targeted therapies.
Key Words: Atherosclerosis, Inflammatory reaction, Machine-learning algorithms, Bioinformatic
INTRODUCTION
The incidence rate of various cardiovascular and cerebrovascular diseases continues to rise, posing a great threat to the safety of patients.1,2 Atherosclerosis (AS) plays an important role in all these diseases.3,4 AS often involves the deposition of lipid plaques in the blood on the arterial wall. With the continuous release of inflammatory factors, the arterial wall is in a chronic inflammatory reaction state,5 ultimately leading to plaque rupture, thrombosis, and luminal stenosis, which in turn trigger a series of major adverse cardiovascular events (MACEs).6 Currently, the main treatment strategy for AS is the use of statins, which can lower low-density lipoprotein cholesterol levels.7 However, this therapy has not effectively reduced the occurrence of MACEs.8 Therefore, deciphering the pathogenic mechanism of AS can help guide and improve the effects of clinical diagnosis, treatment, and outcomes.
The concept of inflammation, which plays a key role in AS pathogenesis, has received increasing attention over the years, and many researchers have attempted to combine immune and anti-inflammatory treatments to reduce MACEs.9 For example, the anti-inflammatory medicine canakinumab, which is an interleukin-1β-specific antibody, can significantly reduce the occurrence of MACEs.10 The aim of this study was to outline immune-cell infiltration and identify diagnostic genes for AS, via machine-learning algorithms, to better understand the potential molecular processes involved in AS development.
METHODOLOGY
Using the GPL17077 platform of the gene expression omnibus (GEO) database; available from: (httpss://www.ncbi.nlm.nih. gov/geo/query/acc.cgi), GSE100927, which contains 69 AS tissues and 35 control tissues, was obtained. Inclusion criteria were AS patients with definite diagnoses and signed informed consent. Exclusion criteria were patients with non-AS peripheral artery disease, thrombosis or restenosis.
Differentially expressed genes (DEGs) were identified using the limma R package in R studio (version 4.1.1; R Foundation for Statistical Computing, Vienna, Austria) with a statistical threshold of logFC >1 and a false discovery rate of <0.05.
Gene set enrichment analysis (GSEA), available from: httpss:// www.broadinstitute.org/gsea was performed to identify DEGs and their enrichment functions between the two queues to determine the impact of synergistic gene changes on diseases within the gene set.11
Three machine-learning algorithms, least absolute shrinkage and selection operator (LASSO),12 support vector machine recursive feature elimination (SVM-RFE) analysis,13 and random forest (RF) analysis,14 were used to investigate the genes that play crucial roles in AS pathogenesis. The core genes were obtained by the intersection of the three algorithms.
Single-sample gene set enrichment analysis (ssGSEA) was based on 29 sets of immune genes to comprehensively evaluate the differences in immune characteristics between the two groups.15
The t-test was used to compare data between different groups. Statistical analysis and data visualisation were performed using the R Studio Software version 4.1.1, and GraphPad Prism version 8.0; GraphPad Software, Boston, MA, USA, respectively. Receiver operating characteristic (ROC) curves were drawn, and the area under the ROC curve (AUC) was calculated using the survival ROC R package. Spearman's rank correlation coefficient was used to evaluate the correlation between the two groups. Statistical significance was set at p <0.05.
RESULTS
GSE100927, which contains 69 AS tissues, was obtained (Figure 1A), and calibration was performed for subsequent analyses (Figure 1B). First, differential expression analysis of AS and control tissue data was conducted, demonstrating that 1,526 and 1,781 genes were up- and down- regulated, respectively (Figure 1C). The top 30 DEGs were displayed on a heat map (Figure 1D). Simultaneously, GSEA was performed to explore the functional differences between AS and control tissues. It was observed that DEGs were mainly enriched in inflammation-related pathways, including Th1 and Th2 cell differentiation, Toll-like receptor signalling pathway, and nucleotide-binding oligomerisation domain (NOD) such as receptor signalling pathway (Figure 1E). These results suggested that AS patho- genesis is closely related to inflammatory infiltration.
Three machine-learning algorithms were used to further investigate the genes that play crucial roles in this process (Figure 2A-C) and successfully intersected four genes: IBSP, PI16, MYOC, and IGLL5 (Figure 2D, Table I).
Subsequently, gene function analysis of these four core genes was predicted using GSEA, the results showed that IBSP was mainly associated with endocrine imbalance and abnormal fat metabolism (Figure 3A, B); PI16 was mainly associated with cholesterol and amino acid metabolism (Figure 3C, D); MYOC was mainly associated with lysosomes and cardiovascular disease (Figure 3E, F); IGLL5 was mainly associated with glucose and lipid metabolism and inflammatory reactions (Figure 3G, H). In conclusion, these functional abnormalities can contribute to the development of AS.
Nomogram analysis of IBSP, PI16, MYOC, and IGLL5 was conducted, all of which showed good diagnostic performances. In addition, ROC analysis was conducted on IBSP, PI16, MYOC, and IGLL5 to determine their potential as diagnostic biomarkers of AS (AUC: 0.959, 0.946, 0.931, and 0.880, respectively).
Based on these findings, the four identified core genes were all found to be related to inflammation. Therefore, ssGSEA was used to explore the specific relationships between the four genes and inflammatory cells and pathways. The result showed that they were significantly correlated with numerous immune-immersion-related pathways. Further- more, Spearman's rank correlation coefficient revealed that each individual gene was significantly associated with immune-inflammatory reaction-related pathways (Table II). These results indicate that inflammatory cells and factors play important roles in AS pathogenesis.
DISCUSSION
This study obtained and calibrated the AS data from a public database, identified the DEGs between the AS and control tissues, performed GSEA on DEGs, and detected that these genes were strongly linked to the inflammatory reaction pathways. The intersecting genes of three machine-learning algorithms were IBSP, PI16, MYOC, and IGLL5, suggesting they may be core genes in the occurrence and development of AS. The possible regulatory functions of these four genes revealed that they were all related to inflammatory reactions and meta- bolic abnormalities, which is consistent with AS pathogenesis.16,17 Current research indicates a causal relationship between IBSP, MYOC, and AS, but the specific regulatory mechanisms remain to be elucidated.18,19 Studies have shown that under high endo-thelial shear stress, PI16 inhibits protease activity, and thus plays a protective role during inflammation.20 Simultaneously, PI16 also participates in cholesterol transfer and affects AS development.21 Relatively few reports exist on IGLL5; currently, research mainly focuses on its role in haematological and autoimmune diseases.22,23 It is believed that with further research, the regulatory network between IGLL5 and AS will gradually be clarified.
Table I: The gene results of three machine-learning algorithms.
LASSO |
SVM-REF |
RF |
IBSP |
IGLL5 |
IBSP |
PI16 |
PI16 |
PI16 |
MYOC |
IBSP |
IGLL5 |
IGLL5 |
ACP5 |
CCL3 |
- |
MYOC |
ACP5 |
- |
PLA2G2A |
MYOC |
- |
CCL3 |
MMP9 |
- |
MMP9 |
PLA2G2A |
- |
SPP1 |
CHI3L1 |
- |
CHI3L1 |
SPP1 |
Figure 2: (A-C) The gene results of LASSO, SVM-RFE, and RF. (D) Venn diagram showed four genes, IBSP, PI16, MYOC, and IGLL5 were intersected.
Table II: The p-value of core genes in different pathways of Spearman's rank correlation.
Id |
IBSP |
IGLL5 |
MYOC |
PI16 |
||
Hallmark_xenobiotic_metabolism |
0.032 |
|||||
Hallmark_wnt_beta_catenin_signalling |
p <0.001 |
|||||
Hallmark_uv_response_up |
0.027 |
0.034 |
||||
Hallmark_uv_response_dn |
0.003 |
0.017 |
p <0.001 |
0.019 |
||
Hallmark_unfolded_protein_response |
0.032 |
0.036 |
||||
Hallmark_tnfa_signalling_via_nfkb |
||||||
Hallmark_tgf_beta_signalling |
0.006 |
p <0.001 |
||||
Hallmark_spermatogenesis |
0.006 |
|||||
Hallmark_reactive_oxygen_species_pathway |
0.004 |
|||||
Hallmark_protein_secretion |
0.008 |
|||||
Hallmark_pi3k_akt_mtor_signalling |
p <0.001 |
|||||
Hallmark_peroxisome |
0.005 |
|||||
Hallmark_pancreas_beta_cells |
||||||
Hallmark_p53_pathway |
||||||
Hallmark_oxidative_phosphorylation |
0.041 |
|||||
Hallmark_notch_signalling |
0.004 |
|||||
Hallmark_myogenesis |
0.021 |
0.028 |
p <0.001 |
0.003 |
||
Hallmark_myc_targets_v2 |
p <0.001 |
|||||
Hallmark_myc_targets_v1 |
0.041 |
0.003 |
||||
Hallmark_mtorc1_signalling |
p <0.001 |
|||||
Hallmark_mitotic_spindle |
||||||
Hallmark_kras_signalling_up |
||||||
Hallmark_kras_signalling_dn |
p <0.001 |
0.013 |
||||
Hallmark_interferon_gamma_response |
p <0.001 |
0.017 |
||||
Hallmark_interferon_alpha_response |
p <0.001 |
0.020 |
||||
Hallmark_inflammatory_response |
0.039 |
0.035 |
||||
Hallmark_il6_jak_stat3_signalling |
0.011 |
0.007 |
0.021 |
|||
Hallmark_il2_stat5_signalling |
0.041 |
|||||
Hallmark_hypoxia |
||||||
Hallmark_heme_metabolism |
0.006 |
|||||
Hallmark_hedgehog_signalling |
p <0.001 |
0.019 |
p <0.001 |
0.002 |
||
Hallmark_glycolysis |
p <0.001 |
|||||
Hallmark_g2m_checkpoint |
p <0.001 |
|||||
Hallmark_fatty_acid_metabolism |
0.032 |
|||||
Hallmark_estrogen_response_late |
p <0.001 |
p <0.001 |
||||
Hallmark_estrogen_response_early |
p <0.001 |
p <0.001 |
||||
Hallmark_epithelial_mesenchymal_transition |
0.042 |
0.031 |
0.007 |
|||
Hallmark_e2f_targets |
p <0.001 |
|||||
Hallmark_dna_repair |
p <0.001 |
|||||
Hallmark_complement |
0.020 |
0.002 |
p <0.001 |
|||
Hallmark_coagulation |
0.042 |
|||||
Hallmark_cholesterol_homeostasis |
p <0.001 |
|||||
Hallmark_bile_acid_metabolism |
0.008 |
|||||
Hallmark_apoptosis |
||||||
Hallmark_apical_surface |
p <0.001 |
0.013 |
||||
Hallmark_apical_junction |
0.013 |
p <0.001 |
0.004 |
|||
Hallmark_angiogenesis |
0.043 |
|||||
Hallmark_androgen_response |
||||||
Hallmark_allograft_rejection |
0.001 |
p <0.001 |
p <0.001 |
|||
Hallmark_adipogenesis |
0.027 |
Figure 3: Functional enrichment pathways of up- and down-regulation of IBSP (A-B); PI16 (C-D); MYOC E-F; IGLL5 (G-H), using GSEA.
ROC analysis was conducted to further explore the clinical translational value of IBSP, PI16, MYOC, IGLL5, and the AUC values of the four all exceeded (0.85), demonstrating good diagnostic performance. Finally, an immune infiltration analysis of the four genes was conducted, and it was found that both the integration and individual analyses showed a strong correlation with immune-related signalling pathways. This confirmed that IBSP, PI16, MYOC, and IGLL5 may participate in AS formation and progression by regulating inflammatory reactions.
This study has some limitations. Although various bio-informatics methods were utilised to identify potential AS-related regulatory genes, further basic research is needed to validate the results. IBSP, PI16, MYOC, and IGLL5 were identified as potential novel diagnostic markers for AS, however, large-sample clinical patient data is required to validate the findings.
CONCLUSION
IBSP, PI16, MYOC, and IGLL5 were found to participate in AS pathogenesis by regulating inflammatory reactions. Thus, they can be considered novel diagnostic markers for AS and are expected to become new targets for AS-targeted therapy.
ETHICAL APPROVAL:
The data for this study were acquired from public databases, did not involve the testing of human and animal samples. Therefore, ethical approval is not applicable.
PATIENTS’ CONSENT:
Informed consent was obtained from the participants included in this study.
COMPETING INTEREST:
The authors declared no conflict of interest.
AUTHORS’ CONTRIBUTION:
YC: Data acquisition, formal analysis, and original draft.
YS: Supervision, data validation, review, and revision of the work.
Both authors approved the final version of the manuscript to be published.
REFERENCES